Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 3900 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 356.8 B |
Variable types
| NUM | 19 |
|---|---|
| CAT | 3 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-05-10 01:03:10.317732 |
|---|---|
| Analysis finished | 2020-05-10 01:03:58.755557 |
| Version | pandas-profiling v2.6.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Time_Room_Service is highly correlated with Deposit_Kept | High Correlation |
Deposit_Kept is highly correlated with Time_Room_Service | High Correlation |
Room has 146 (3.7%) zeros | Zeros |
Check-in/Check-out has 208 (5.3%) zeros | Zeros |
F&B has 173 (4.4%) zeros | Zeros |
Entertainment has 84 (2.2%) zeros | Zeros |
Deposit_Kept has 2167 (55.6%) zeros | Zeros |
Time_Room_Service has 2158 (55.3%) zeros | Zeros |
| Distinct count | 3900 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16412.674871794872 |
|---|---|
| Minimum | 10007 |
| Maximum | 22996 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 10007 |
|---|---|
| 5-th percentile | 10610.9 |
| Q1 | 13171.5 |
| median | 16431.5 |
| Q3 | 19648.25 |
| 95-th percentile | 22273.25 |
| Maximum | 22996 |
| Range | 12989 |
| Interquartile range (IQR) | 6476.75 |
Descriptive statistics
| Standard deviation | 3725.011331 |
|---|---|
| Coefficient of variation (CV) | 0.2269594298 |
| Kurtosis | -1.178374853 |
| Mean | 16412.67487 |
| Median Absolute Deviation (MAD) | 3216.357923 |
| Skewness | 0.003653074544 |
| Sum | 64009432 |
| Variance | 13875709.42 |
| Value | Count | Frequency (%) | |
| 10239 | 1 | < 0.1% | |
| 21111 | 1 | < 0.1% | |
| 18536 | 1 | < 0.1% | |
| 10932 | 1 | < 0.1% | |
| 17069 | 1 | < 0.1% | |
| 10924 | 1 | < 0.1% | |
| 12971 | 1 | < 0.1% | |
| 17065 | 1 | < 0.1% | |
| 12967 | 1 | < 0.1% | |
| 17061 | 1 | < 0.1% | |
| Other values (3890) | 3890 | 99.7% |
| Value | Count | Frequency (%) | |
| 10007 | 1 | < 0.1% | |
| 10009 | 1 | < 0.1% | |
| 10015 | 1 | < 0.1% | |
| 10016 | 1 | < 0.1% | |
| 10017 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 22996 | 1 | < 0.1% | |
| 22995 | 1 | < 0.1% | |
| 22993 | 1 | < 0.1% | |
| 22990 | 1 | < 0.1% | |
| 22987 | 1 | < 0.1% |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.6 KiB |
| Female | |
|---|---|
| Male |
| Value | Count | Frequency (%) | |
| Female | 2032 | 52.1% | |
| Male | 1868 | 47.9% |
Length
| Max length | 6 |
|---|---|
| Mean length | 5.042051282 |
| Min length | 4 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 4 | 66.7% | |
| Uppercase_Letter | 2 | 33.3% |
| Value | Count | Frequency (%) | |
| Latin | 6 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 6 | 100.0% |
Frequent_Traveler
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.6 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 3200 | 82.1% | |
| 0 | 700 | 17.9% |
Age
Real number (ℝ≥0)
| Distinct count | 73 |
|---|---|
| Unique (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.91692307692308 |
|---|---|
| Minimum | 7 |
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 27 |
| median | 41 |
| Q3 | 52 |
| 95-th percentile | 64 |
| Maximum | 80 |
| Range | 73 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 15.26546393 |
|---|---|
| Coefficient of variation (CV) | 0.3824308778 |
| Kurtosis | -0.7517269127 |
| Mean | 39.91692308 |
| Median Absolute Deviation (MAD) | 12.74626746 |
| Skewness | -0.03040519735 |
| Sum | 155676 |
| Variance | 233.0343891 |
| Value | Count | Frequency (%) | |
| 45 | 114 | 2.9% | |
| 52 | 111 | 2.8% | |
| 39 | 107 | 2.7% | |
| 23 | 98 | 2.5% | |
| 40 | 98 | 2.5% | |
| 25 | 96 | 2.5% | |
| 24 | 96 | 2.5% | |
| 41 | 93 | 2.4% | |
| 22 | 92 | 2.4% | |
| 46 | 88 | 2.3% | |
| Other values (63) | 2907 | 74.5% |
| Value | Count | Frequency (%) | |
| 7 | 22 | 0.6% | |
| 8 | 20 | 0.5% | |
| 9 | 14 | 0.4% | |
| 10 | 22 | 0.6% | |
| 11 | 28 | 0.7% |
| Value | Count | Frequency (%) | |
| 80 | 6 | 0.2% | |
| 79 | 1 | < 0.1% | |
| 78 | 2 | 0.1% | |
| 77 | 5 | 0.1% | |
| 76 | 4 | 0.1% |
Type
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.6 KiB |
| Business travel | |
|---|---|
| Personal Travel |
| Value | Count | Frequency (%) | |
| Business travel | 2695 | 69.1% | |
| Personal Travel | 1205 | 30.9% |
Length
| Max length | 15 |
|---|---|
| Mean length | 15 |
| Min length | 15 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 11 | 73.3% | |
| Uppercase_Letter | 3 | 20.0% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Latin | 14 | 93.3% | |
| Common | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
Flight_Class
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.6 KiB |
| Business | |
|---|---|
| Eco | |
| Eco Plus | 273 |
| Value | Count | Frequency (%) | |
| Business | 1849 | 47.4% | |
| Eco | 1778 | 45.6% | |
| Eco Plus | 273 | 7.0% |
Length
| Max length | 8 |
|---|---|
| Mean length | 5.720512821 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 8 | 66.7% | |
| Uppercase_Letter | 3 | 25.0% | |
| Space_Separator | 1 | 8.3% |
| Value | Count | Frequency (%) | |
| Latin | 11 | 91.7% | |
| Common | 1 | 8.3% |
| Value | Count | Frequency (%) | |
| ASCII | 12 | 100.0% |
Points
Real number (ℝ≥0)
| Distinct count | 2358 |
|---|---|
| Unique (%) | 60.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1964.1866666666667 |
|---|---|
| Minimum | 50 |
| Maximum | 6537 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 344.95 |
| Q1 | 1332 |
| median | 1881.5 |
| Q3 | 2531 |
| 95-th percentile | 3796 |
| Maximum | 6537 |
| Range | 6487 |
| Interquartile range (IQR) | 1199 |
Descriptive statistics
| Standard deviation | 1021.563774 |
|---|---|
| Coefficient of variation (CV) | 0.5200950559 |
| Kurtosis | 0.3189905056 |
| Mean | 1964.186667 |
| Median Absolute Deviation (MAD) | 791.3708034 |
| Skewness | 0.4855762506 |
| Sum | 7660328 |
| Variance | 1043592.545 |
| Value | Count | Frequency (%) | |
| 1487 | 7 | 0.2% | |
| 1874 | 7 | 0.2% | |
| 1653 | 7 | 0.2% | |
| 1805 | 6 | 0.2% | |
| 1765 | 6 | 0.2% | |
| 1853 | 6 | 0.2% | |
| 1812 | 6 | 0.2% | |
| 1940 | 6 | 0.2% | |
| 2374 | 6 | 0.2% | |
| 1622 | 5 | 0.1% | |
| Other values (2348) | 3838 | 98.4% |
| Value | Count | Frequency (%) | |
| 50 | 1 | < 0.1% | |
| 55 | 2 | 0.1% | |
| 58 | 1 | < 0.1% | |
| 62 | 1 | < 0.1% | |
| 63 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6537 | 1 | < 0.1% | |
| 5816 | 1 | < 0.1% | |
| 5776 | 1 | < 0.1% | |
| 5722 | 1 | < 0.1% | |
| 5693 | 1 | < 0.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8482051282051284 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 146 |
| Zeros (%) | 3.7% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.396156509 |
|---|---|
| Coefficient of variation (CV) | 0.4901881873 |
| Kurtosis | -0.9458255092 |
| Mean | 2.848205128 |
| Median Absolute Deviation (MAD) | 1.177744116 |
| Skewness | -0.09536277055 |
| Sum | 11108 |
| Variance | 1.949252997 |
| Value | Count | Frequency (%) | |
| 2 | 886 | 22.7% | |
| 4 | 862 | 22.1% | |
| 3 | 849 | 21.8% | |
| 1 | 611 | 15.7% | |
| 5 | 546 | 14.0% | |
| 0 | 146 | 3.7% |
| Value | Count | Frequency (%) | |
| 0 | 146 | 3.7% | |
| 1 | 611 | 15.7% | |
| 2 | 886 | 22.7% | |
| 3 | 849 | 21.8% | |
| 4 | 862 | 22.1% |
| Value | Count | Frequency (%) | |
| 5 | 546 | 14.0% | |
| 4 | 862 | 22.1% | |
| 3 | 849 | 21.8% | |
| 2 | 886 | 22.7% | |
| 1 | 611 | 15.7% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9956410256410257 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 208 |
| Zeros (%) | 5.3% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.519719563 |
|---|---|
| Coefficient of variation (CV) | 0.5073103052 |
| Kurtosis | -1.049238343 |
| Mean | 2.995641026 |
| Median Absolute Deviation (MAD) | 1.278756345 |
| Skewness | -0.2843441178 |
| Sum | 11683 |
| Variance | 2.30954755 |
| Value | Count | Frequency (%) | |
| 4 | 929 | 23.8% | |
| 5 | 777 | 19.9% | |
| 3 | 720 | 18.5% | |
| 2 | 656 | 16.8% | |
| 1 | 610 | 15.6% | |
| 0 | 208 | 5.3% |
| Value | Count | Frequency (%) | |
| 0 | 208 | 5.3% | |
| 1 | 610 | 15.6% | |
| 2 | 656 | 16.8% | |
| 3 | 720 | 18.5% | |
| 4 | 929 | 23.8% |
| Value | Count | Frequency (%) | |
| 5 | 777 | 19.9% | |
| 4 | 929 | 23.8% | |
| 3 | 720 | 18.5% | |
| 2 | 656 | 16.8% | |
| 1 | 610 | 15.6% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.856923076923077 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 173 |
| Zeros (%) | 4.4% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.442597743 |
|---|---|
| Coefficient of variation (CV) | 0.5049480521 |
| Kurtosis | -0.9986324122 |
| Mean | 2.856923077 |
| Median Absolute Deviation (MAD) | 1.220109665 |
| Skewness | -0.1208369828 |
| Sum | 11142 |
| Variance | 2.081088247 |
| Value | Count | Frequency (%) | |
| 4 | 842 | 21.6% | |
| 3 | 825 | 21.2% | |
| 2 | 815 | 20.9% | |
| 1 | 639 | 16.4% | |
| 5 | 606 | 15.5% | |
| 0 | 173 | 4.4% |
| Value | Count | Frequency (%) | |
| 0 | 173 | 4.4% | |
| 1 | 639 | 16.4% | |
| 2 | 815 | 20.9% | |
| 3 | 825 | 21.2% | |
| 4 | 842 | 21.6% |
| Value | Count | Frequency (%) | |
| 5 | 606 | 15.5% | |
| 4 | 842 | 21.6% | |
| 3 | 825 | 21.2% | |
| 2 | 815 | 20.9% | |
| 1 | 639 | 16.4% |
Location
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.012051282051282 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.298136608 |
|---|---|
| Coefficient of variation (CV) | 0.4309809119 |
| Kurtosis | -1.071411 |
| Mean | 3.012051282 |
| Median Absolute Deviation (MAD) | 1.057353189 |
| Skewness | -0.08076922033 |
| Sum | 11747 |
| Variance | 1.685158653 |
| Value | Count | Frequency (%) | |
| 3 | 1016 | 26.1% | |
| 4 | 934 | 23.9% | |
| 2 | 721 | 18.5% | |
| 1 | 656 | 16.8% | |
| 5 | 573 | 14.7% |
| Value | Count | Frequency (%) | |
| 1 | 656 | 16.8% | |
| 2 | 721 | 18.5% | |
| 3 | 1016 | 26.1% | |
| 4 | 934 | 23.9% | |
| 5 | 573 | 14.7% |
| Value | Count | Frequency (%) | |
| 5 | 573 | 14.7% | |
| 4 | 934 | 23.9% | |
| 3 | 1016 | 26.1% | |
| 2 | 721 | 18.5% | |
| 1 | 656 | 16.8% |
Wifi
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2666666666666666 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 6 |
| Zeros (%) | 0.2% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.319518929 |
|---|---|
| Coefficient of variation (CV) | 0.4039343661 |
| Kurtosis | -1.127038831 |
| Mean | 3.266666667 |
| Median Absolute Deviation (MAD) | 1.146974359 |
| Skewness | -0.1890824049 |
| Sum | 12740 |
| Variance | 1.741130204 |
| Value | Count | Frequency (%) | |
| 4 | 925 | 23.7% | |
| 5 | 899 | 23.1% | |
| 2 | 839 | 21.5% | |
| 3 | 818 | 21.0% | |
| 1 | 413 | 10.6% | |
| 0 | 6 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 6 | 0.2% | |
| 1 | 413 | 10.6% | |
| 2 | 839 | 21.5% | |
| 3 | 818 | 21.0% | |
| 4 | 925 | 23.7% |
| Value | Count | Frequency (%) | |
| 5 | 899 | 23.1% | |
| 4 | 925 | 23.7% | |
| 3 | 818 | 21.0% | |
| 2 | 839 | 21.5% | |
| 1 | 413 | 10.6% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.384102564102564 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 84 |
| Zeros (%) | 2.2% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.335357202 |
|---|---|
| Coefficient of variation (CV) | 0.3945971425 |
| Kurtosis | -0.5402427337 |
| Mean | 3.384102564 |
| Median Absolute Deviation (MAD) | 1.133547666 |
| Skewness | -0.5970774815 |
| Sum | 13198 |
| Variance | 1.783178856 |
| Value | Count | Frequency (%) | |
| 4 | 1288 | 33.0% | |
| 5 | 877 | 22.5% | |
| 3 | 701 | 18.0% | |
| 2 | 608 | 15.6% | |
| 1 | 342 | 8.8% | |
| 0 | 84 | 2.2% |
| Value | Count | Frequency (%) | |
| 0 | 84 | 2.2% | |
| 1 | 342 | 8.8% | |
| 2 | 608 | 15.6% | |
| 3 | 701 | 18.0% | |
| 4 | 1288 | 33.0% |
| Value | Count | Frequency (%) | |
| 5 | 877 | 22.5% | |
| 4 | 1288 | 33.0% | |
| 3 | 701 | 18.0% | |
| 2 | 608 | 15.6% | |
| 1 | 342 | 8.8% |
Gym
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5264102564102564 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.291584987 |
|---|---|
| Coefficient of variation (CV) | 0.3662605576 |
| Kurtosis | -0.7997210228 |
| Mean | 3.526410256 |
| Median Absolute Deviation (MAD) | 1.104800131 |
| Skewness | -0.5678137875 |
| Sum | 13753 |
| Variance | 1.668191778 |
| Value | Count | Frequency (%) | |
| 4 | 1257 | 32.2% | |
| 5 | 1058 | 27.1% | |
| 3 | 653 | 16.7% | |
| 2 | 544 | 13.9% | |
| 1 | 388 | 9.9% |
| Value | Count | Frequency (%) | |
| 1 | 388 | 9.9% | |
| 2 | 544 | 13.9% | |
| 3 | 653 | 16.7% | |
| 4 | 1257 | 32.2% | |
| 5 | 1058 | 27.1% |
| Value | Count | Frequency (%) | |
| 5 | 1058 | 27.1% | |
| 4 | 1257 | 32.2% | |
| 3 | 653 | 16.7% | |
| 2 | 544 | 13.9% | |
| 1 | 388 | 9.9% |
Spa
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4812820512820513 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.292116447 |
|---|---|
| Coefficient of variation (CV) | 0.3711610917 |
| Kurtosis | -0.8956475122 |
| Mean | 3.481282051 |
| Median Absolute Deviation (MAD) | 1.116219855 |
| Skewness | -0.4853974402 |
| Sum | 13577 |
| Variance | 1.669564911 |
| Value | Count | Frequency (%) | |
| 4 | 1201 | 30.8% | |
| 5 | 1023 | 26.2% | |
| 3 | 679 | 17.4% | |
| 2 | 626 | 16.1% | |
| 1 | 369 | 9.5% | |
| 0 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 1 | 369 | 9.5% | |
| 2 | 626 | 16.1% | |
| 3 | 679 | 17.4% | |
| 4 | 1201 | 30.8% |
| Value | Count | Frequency (%) | |
| 5 | 1023 | 26.2% | |
| 4 | 1201 | 30.8% | |
| 3 | 679 | 17.4% | |
| 2 | 626 | 16.1% | |
| 1 | 369 | 9.5% |
Staff
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.483076923076923 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.257333765 |
|---|---|
| Coefficient of variation (CV) | 0.3609836339 |
| Kurtosis | -0.7329475055 |
| Mean | 3.483076923 |
| Median Absolute Deviation (MAD) | 1.070253254 |
| Skewness | -0.5198169209 |
| Sum | 13584 |
| Variance | 1.580888196 |
| Value | Count | Frequency (%) | |
| 4 | 1229 | 31.5% | |
| 5 | 957 | 24.5% | |
| 3 | 833 | 21.4% | |
| 2 | 504 | 12.9% | |
| 1 | 376 | 9.6% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 376 | 9.6% | |
| 2 | 504 | 12.9% | |
| 3 | 833 | 21.4% | |
| 4 | 1229 | 31.5% |
| Value | Count | Frequency (%) | |
| 5 | 957 | 24.5% | |
| 4 | 1229 | 31.5% | |
| 3 | 833 | 21.4% | |
| 2 | 504 | 12.9% | |
| 1 | 376 | 9.6% |
Pool
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4951282051282053 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 10 |
| Zeros (%) | 0.3% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.282480788 |
|---|---|
| Coefficient of variation (CV) | 0.3669338328 |
| Kurtosis | -0.8390520795 |
| Mean | 3.495128205 |
| Median Absolute Deviation (MAD) | 1.106606969 |
| Skewness | -0.4968268433 |
| Sum | 13631 |
| Variance | 1.644756973 |
| Value | Count | Frequency (%) | |
| 4 | 1204 | 30.9% | |
| 5 | 1030 | 26.4% | |
| 3 | 679 | 17.4% | |
| 2 | 651 | 16.7% | |
| 1 | 326 | 8.4% | |
| 0 | 10 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 10 | 0.3% | |
| 1 | 326 | 8.4% | |
| 2 | 651 | 16.7% | |
| 3 | 679 | 17.4% | |
| 4 | 1204 | 30.9% |
| Value | Count | Frequency (%) | |
| 5 | 1030 | 26.4% | |
| 4 | 1204 | 30.9% | |
| 3 | 679 | 17.4% | |
| 2 | 651 | 16.7% | |
| 1 | 326 | 8.4% |
Baggage_Handling
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6956410256410255 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.142986676 |
|---|---|
| Coefficient of variation (CV) | 0.3092796807 |
| Kurtosis | -0.2663876412 |
| Mean | 3.695641026 |
| Median Absolute Deviation (MAD) | 0.9367589744 |
| Skewness | -0.7097484223 |
| Sum | 14413 |
| Variance | 1.306418543 |
| Value | Count | Frequency (%) | |
| 4 | 1429 | 36.6% | |
| 5 | 1067 | 27.4% | |
| 3 | 771 | 19.8% | |
| 2 | 416 | 10.7% | |
| 1 | 217 | 5.6% |
| Value | Count | Frequency (%) | |
| 1 | 217 | 5.6% | |
| 2 | 416 | 10.7% | |
| 3 | 771 | 19.8% | |
| 4 | 1429 | 36.6% | |
| 5 | 1067 | 27.4% |
| Value | Count | Frequency (%) | |
| 5 | 1067 | 27.4% | |
| 4 | 1429 | 36.6% | |
| 3 | 771 | 19.8% | |
| 2 | 416 | 10.7% | |
| 1 | 217 | 5.6% |
Reception
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.337948717948718 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.245312688 |
|---|---|
| Coefficient of variation (CV) | 0.3730772379 |
| Kurtosis | -0.7437995727 |
| Mean | 3.337948718 |
| Median Absolute Deviation (MAD) | 1.042034188 |
| Skewness | -0.3947413292 |
| Sum | 13018 |
| Variance | 1.550803691 |
| Value | Count | Frequency (%) | |
| 3 | 1116 | 28.6% | |
| 4 | 1101 | 28.2% | |
| 5 | 784 | 20.1% | |
| 1 | 452 | 11.6% | |
| 2 | 447 | 11.5% |
| Value | Count | Frequency (%) | |
| 1 | 452 | 11.6% | |
| 2 | 447 | 11.5% | |
| 3 | 1116 | 28.6% | |
| 4 | 1101 | 28.2% | |
| 5 | 784 | 20.1% |
| Value | Count | Frequency (%) | |
| 5 | 784 | 20.1% | |
| 4 | 1101 | 28.2% | |
| 3 | 1116 | 28.6% | |
| 2 | 447 | 11.5% | |
| 1 | 452 | 11.6% |
Cleanliness
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.697948717948718 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.13402727 |
|---|---|
| Coefficient of variation (CV) | 0.3066638713 |
| Kurtosis | -0.23456247 |
| Mean | 3.697948718 |
| Median Absolute Deviation (MAD) | 0.9271008547 |
| Skewness | -0.7148839475 |
| Sum | 14422 |
| Variance | 1.286017848 |
| Value | Count | Frequency (%) | |
| 4 | 1459 | 37.4% | |
| 5 | 1050 | 26.9% | |
| 3 | 762 | 19.5% | |
| 2 | 422 | 10.8% | |
| 1 | 206 | 5.3% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 206 | 5.3% | |
| 2 | 422 | 10.8% | |
| 3 | 762 | 19.5% | |
| 4 | 1459 | 37.4% |
| Value | Count | Frequency (%) | |
| 5 | 1050 | 26.9% | |
| 4 | 1459 | 37.4% | |
| 3 | 762 | 19.5% | |
| 2 | 422 | 10.8% | |
| 1 | 206 | 5.3% |
Online_Booking
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.345128205128205 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.276924186 |
|---|---|
| Coefficient of variation (CV) | 0.3817265312 |
| Kurtosis | -0.8834224102 |
| Mean | 3.345128205 |
| Median Absolute Deviation (MAD) | 1.086421565 |
| Skewness | -0.3656400321 |
| Sum | 13046 |
| Variance | 1.630535377 |
| Value | Count | Frequency (%) | |
| 4 | 1082 | 27.7% | |
| 3 | 971 | 24.9% | |
| 5 | 852 | 21.8% | |
| 2 | 551 | 14.1% | |
| 1 | 443 | 11.4% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 443 | 11.4% | |
| 2 | 551 | 14.1% | |
| 3 | 971 | 24.9% | |
| 4 | 1082 | 27.7% |
| Value | Count | Frequency (%) | |
| 5 | 852 | 21.8% | |
| 4 | 1082 | 27.7% | |
| 3 | 971 | 24.9% | |
| 2 | 551 | 14.1% | |
| 1 | 443 | 11.4% |
| Distinct count | 182 |
|---|---|
| Unique (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.550256410256411 |
|---|---|
| Minimum | 0 |
| Maximum | 569 |
| Zeros | 2167 |
| Zeros (%) | 55.6% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 13 |
| 95-th percentile | 74.05 |
| Maximum | 569 |
| Range | 569 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 36.53200823 |
|---|---|
| Coefficient of variation (CV) | 2.510746697 |
| Kurtosis | 39.99143583 |
| Mean | 14.55025641 |
| Median Absolute Deviation (MAD) | 19.77595556 |
| Skewness | 5.22328422 |
| Sum | 56746 |
| Variance | 1334.587625 |
| Value | Count | Frequency (%) | |
| 0 | 2167 | 55.6% | |
| 1 | 124 | 3.2% | |
| 2 | 91 | 2.3% | |
| 3 | 73 | 1.9% | |
| 4 | 71 | 1.8% | |
| 5 | 68 | 1.7% | |
| 6 | 65 | 1.7% | |
| 7 | 60 | 1.5% | |
| 16 | 45 | 1.2% | |
| 10 | 44 | 1.1% | |
| Other values (172) | 1092 | 28.0% |
| Value | Count | Frequency (%) | |
| 0 | 2167 | 55.6% | |
| 1 | 124 | 3.2% | |
| 2 | 91 | 2.3% | |
| 3 | 73 | 1.9% | |
| 4 | 71 | 1.8% |
| Value | Count | Frequency (%) | |
| 569 | 1 | < 0.1% | |
| 415 | 1 | < 0.1% | |
| 358 | 1 | < 0.1% | |
| 351 | 1 | < 0.1% | |
| 341 | 1 | < 0.1% |
| Distinct count | 190 |
|---|---|
| Unique (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4865641025641028 |
|---|---|
| Minimum | 0.0 |
| Maximum | 54.3 |
| Zeros | 2158 |
| Zeros (%) | 55.3% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1.3 |
| 95-th percentile | 7.305 |
| Maximum | 54.3 |
| Range | 54.3 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 3.684135386 |
|---|---|
| Coefficient of variation (CV) | 2.478288948 |
| Kurtosis | 37.40153856 |
| Mean | 1.486564103 |
| Median Absolute Deviation (MAD) | 2.009098277 |
| Skewness | 5.116669975 |
| Sum | 5797.6 |
| Variance | 13.57285354 |
| Value | Count | Frequency (%) | |
| 0 | 2158 | 55.3% | |
| 0.1 | 100 | 2.6% | |
| 0.4 | 81 | 2.1% | |
| 0.2 | 80 | 2.1% | |
| 0.3 | 80 | 2.1% | |
| 0.5 | 64 | 1.6% | |
| 0.6 | 58 | 1.5% | |
| 0.7 | 56 | 1.4% | |
| 0.8 | 55 | 1.4% | |
| 0.9 | 52 | 1.3% | |
| Other values (180) | 1116 | 28.6% |
| Value | Count | Frequency (%) | |
| 0 | 2158 | 55.3% | |
| 0.1 | 100 | 2.6% | |
| 0.2 | 80 | 2.1% | |
| 0.3 | 80 | 2.1% | |
| 0.4 | 81 | 2.1% |
| Value | Count | Frequency (%) | |
| 54.3 | 1 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 35.7 | 1 | < 0.1% | |
| 35.2 | 1 | < 0.1% | |
| 34.9 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| Guest_ID | Gender | Frequent_Traveler | Age | Type | Flight_Class | Points | Room | Check-in/Check-out | F&B | Location | Wifi | Entertainment | Gym | Spa | Staff | Pool | Baggage_Handling | Reception | Cleanliness | Online_Booking | Deposit_Kept | Time_Room_Service | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 19847 | Female | 0 | 38 | Business travel | Eco | 2097 | 3 | 3 | 3 | 4 | 3 | 1 | 1 | 1 | 1 | 3 | 3 | 1 | 4 | 1 | 112 | 13.5 |
| 1 | 12433 | Female | 1 | 46 | Business travel | Business | 1629 | 3 | 3 | 3 | 3 | 2 | 5 | 4 | 4 | 4 | 4 | 4 | 5 | 4 | 3 | 0 | 0.0 |
| 2 | 10273 | Male | 1 | 33 | Business travel | Business | 1615 | 5 | 5 | 5 | 5 | 4 | 4 | 4 | 4 | 5 | 3 | 3 | 3 | 2 | 4 | 0 | 0.0 |
| 3 | 12457 | Male | 0 | 38 | Business travel | Eco | 1520 | 3 | 3 | 3 | 4 | 2 | 3 | 2 | 2 | 3 | 3 | 4 | 4 | 5 | 2 | 0 | 0.0 |
| 4 | 22903 | Female | 0 | 27 | Business travel | Business | 3524 | 3 | 3 | 3 | 4 | 2 | 3 | 2 | 2 | 4 | 3 | 5 | 3 | 5 | 2 | 10 | 0.0 |
| 5 | 22449 | Male | 1 | 37 | Personal Travel | Eco | 3192 | 2 | 4 | 2 | 5 | 1 | 2 | 1 | 1 | 4 | 4 | 4 | 5 | 5 | 1 | 0 | 0.1 |
| 6 | 14787 | Male | 0 | 41 | Business travel | Business | 1518 | 5 | 5 | 5 | 2 | 1 | 5 | 1 | 1 | 5 | 3 | 5 | 5 | 5 | 1 | 0 | 0.0 |
| 7 | 15158 | Male | 1 | 32 | Business travel | Business | 1388 | 5 | 5 | 5 | 5 | 5 | 5 | 4 | 5 | 4 | 2 | 5 | 4 | 5 | 5 | 24 | 2.4 |
| 8 | 22185 | Female | 0 | 24 | Business travel | Business | 1796 | 5 | 2 | 5 | 3 | 5 | 1 | 1 | 4 | 3 | 5 | 4 | 1 | 3 | 1 | 0 | 0.0 |
| 9 | 14633 | Male | 1 | 31 | Business travel | Business | 2184 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 4 | 2 | 5 | 4 | 4 | 2 | 3 | 0.0 |
Last rows
| Guest_ID | Gender | Frequent_Traveler | Age | Type | Flight_Class | Points | Room | Check-in/Check-out | F&B | Location | Wifi | Entertainment | Gym | Spa | Staff | Pool | Baggage_Handling | Reception | Cleanliness | Online_Booking | Deposit_Kept | Time_Room_Service | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3890 | 20327 | Male | 0 | 33 | Business travel | Eco | 2146 | 4 | 3 | 4 | 3 | 5 | 4 | 2 | 5 | 4 | 4 | 1 | 2 | 2 | 5 | 40 | 3.0 |
| 3891 | 14785 | Male | 1 | 72 | Business travel | Business | 3854 | 3 | 2 | 2 | 2 | 5 | 3 | 4 | 3 | 3 | 3 | 3 | 4 | 3 | 3 | 0 | 0.0 |
| 3892 | 22341 | Female | 1 | 13 | Personal Travel | Eco | 3453 | 5 | 4 | 5 | 2 | 1 | 5 | 1 | 1 | 1 | 4 | 4 | 4 | 2 | 1 | 0 | 0.0 |
| 3893 | 20010 | Female | 1 | 27 | Personal Travel | Eco | 1734 | 1 | 1 | 1 | 1 | 4 | 4 | 4 | 4 | 4 | 3 | 5 | 3 | 4 | 4 | 5 | 0.0 |
| 3894 | 17139 | Male | 1 | 56 | Business travel | Eco | 1911 | 3 | 5 | 5 | 5 | 4 | 3 | 4 | 4 | 2 | 4 | 1 | 1 | 1 | 4 | 70 | 10.7 |
| 3895 | 18266 | Female | 1 | 34 | Business travel | Business | 3532 | 5 | 5 | 5 | 5 | 3 | 3 | 3 | 5 | 3 | 5 | 5 | 3 | 5 | 3 | 273 | 30.8 |
| 3896 | 21243 | Female | 1 | 13 | Personal Travel | Eco | 1701 | 5 | 5 | 1 | 5 | 5 | 3 | 4 | 4 | 4 | 4 | 4 | 5 | 4 | 2 | 0 | 0.0 |
| 3897 | 19539 | Female | 1 | 17 | Personal Travel | Eco | 1643 | 4 | 5 | 4 | 3 | 1 | 4 | 4 | 1 | 5 | 2 | 4 | 3 | 4 | 1 | 0 | 3.2 |
| 3898 | 15253 | Male | 1 | 23 | Personal Travel | Eco | 2721 | 3 | 2 | 3 | 3 | 3 | 4 | 4 | 2 | 2 | 3 | 3 | 4 | 4 | 4 | 152 | 16.4 |
| 3899 | 22708 | Female | 1 | 52 | Business travel | Business | 218 | 0 | 5 | 0 | 2 | 5 | 5 | 4 | 5 | 5 | 5 | 5 | 4 | 5 | 4 | 0 | 0.3 |